A  REVIEW  AND  SOME  EXTENSIONS  OF 
TAKEMURA'S  GENERALIZATIONS  OF  COCHRAN’S  THEOREM 


TECHNICAL  REPORT  NO.  2 


GEORGE  P.J.  STYAN 


SEPTEMBER  1982 


U.  S.  ARMY  RESEARCH  OFFICE 
CONTRACT  DAAG29-82-K-0156 


DEPARTMENT  OF  STATISTICS 
STANFORD  UNIVERSITY 
STANFORD,  CALIFORNIA 


APPROVED  FOR  PUBLIC  RELEASE;  DISTRIBUTION  UNLIMITED 


A  REVIEW  AND  SOME  EXTENSIONS  OF 
TAKEMURA'S  GENERALIZATIONS  OF  COCHRAN’S  THEOREM 


TECHNICAL  REPORT  NO.  2 


GEORGE  P.J.  STYAN 
McGILL  UNIVERSITY 


SEPTEMBER  1982 


U.  S.  ARMY  RESEARCH  OFFICE 

CONTRACT  DAAG29-82-K-0156 

Also  issued  as  Technical  Report  No.  56  under 
Office  of  Naval  Research  Contract  No.  N00014-75-C-0442, 
Department  of  Statistics,  Stanford  University. 


DEPARTMENT  OF  STATISTICS 
STANFORD  UNIVERSITY 
STANFORD,  CALIFORNIA 


APPROVED  FOR  PUBLIC  RELEASE;  DISTRIBUTION  UNLIMITED 


THE  VIEW,  OPINIONS,  AND/OR  FINDINGS  CONTAINED  IN  THIS  REPORT  ARE  THOSE  OF 
THE  AUTHOR(S)  AND  SHOULD  NOT  BE  CONSTRUED  AS  AN  OFFICIAL  DEPARTMENT  OF  THE 
ARMY  POSITION,  POLICY,  OR  DECISION,  UNLESS  SO  DESIGNATED  BY  OTHER  DOCUMEN¬ 
TATION. 


A  REVIEW  AND  SOME  EXTENSIONS  OF 


TAKEMURA'S  GENERALIZATIONS  OF  COCHRAN’S  THEOREM 

by 

George  P.H.  Styan 
McGill  University 

This  is  a  review  of  the  Stanford  Technical  Report  No. 44  by  Akimichi 
Takemura  "On  Generalizations  of  Cochran's  Theorem  and  Projection  Matrices" 
(August  1980).  A  number  of  extensions  are  also  presented.  In  Section  1 
we  review  the  notions  of  direct  sum  of  matrices  and  of  vector  spaces,  and 
show  the  equivalence  of  several  different  definitions.  We  also  examine  Take¬ 
mura  's  "independent"  vector  spaces  and  find  that  they  coincide  with  the 
"disjoint"  vector  spaces  considered  by  Rao  and  Yanai  (1979).  The  connection 
of  this  concept  with  rank  additivity  is  explored  in  Section  2.  In  Sections 
3  and  4  we  elaborate  and  extend  several  results  of  Takemura' s  on  rank  additivity 
and  r-potent  matrices  and  matrix  polynomials.  These  results  build  on  those 
given  by  Anderson  and  Styan  (1980)  . 


1 .  Direct  Sums,  Virtual  Disjointness  and  Independence. 


Let  the  vector  spaces  U ,  ..., 


column  vectors  and  let  the  matrices 


be  subspaces  of  a  vector  space  X  of 
. ..,  A^  be  defined  by 


mxl 


(1.1) 


C(V  =  V  1  =  l>  •••»  k> 


where  C(.)  denotes  column  space  (range).  The  matrices  A^, 
same  number  of  rows:  m.  Let  A.  be  mxn.  and  write  n  = 

l  l 


yk 

E  n.  . 
1  i 


all  have  the 
Let 


2. 


(1.2) 


be  the  km  *n  block -diagonal  matrix  of  the  A.'s. 

The  matrix  D  is  defined  as  the  dOizct  6 um  o£  the  ma&u.ceA  A  ,  . .  . ,  A 

1  k 

(cf.  Marcus  and  Mine,  1964,  pages  5-6)  .  This  definition  is  also  valid  when  the 
A.  do  not  all  have  the  same  number  of  rows.  We  may  write 

l 


(1.3) 


D  =  A  ®  ...  ®  A^  ■ 


Let  the  kmxm  partitioned  matrix 


(1.4) 


K  =  (I  ,  I  )', 

m  m 


cf.  Anderson  and  Styan  (1980,  p.ll).  Then  the  m*n  partitioned  matrix 


(1.5) 


B  =  (Ax,  ...,  Ak)  =  K'D 


spans  the  union  of  the  vector  spaces  U  ,  ...,  U  , 

1  Jc 


(1.6) 


U  =  U  u  . . .  u  Uk  =  C(A1,  ...,  A^) 


C(B)  . 


We  define  the  matrices  A  ,  . . . ,  A  and  the  vector  spaces  U  ,  ,  U 

lk  lk 

to  be  mutwxlZy  VAJituatty  dujoZnt  whenever 


3. 


(1.7)  r(A1,  . ..,  A^  =  zJr(A.), 

where  r(.)  denotes  rank,  cf.  Styan  (1981).  We  may  write  (1.7)  as 

(1.8)  r(K'D)  =  r(D) 


or  equivalently  as 

(1.9)  dim(U  u  ...  u  U  )  =  E^dirnU. , 

1  K.  1  1 


where  dim(.)  denotes  dimension  of  the  vector  space. 


When  the  vector  spaces  U^,  . ..,  U ^  are  mutuatly  vVitiiaULy  disjoint  then 
we  define  their  union  as  the  diA&ct  6um  o£  £k&  v&cXosl  6pa.c.eJ>  U^,  . ..,  U^,  and 
we  write 


(1.10)  U  ©  ...  ©  =  U  u  ...  u  U  =  C(A1,  . ..,  A^  =  C(K'D). 

Rao  and  Mitra  (1971,  page  3,  lines  5-6)  define  the  two  vector  spaces  U and 
U.  to  be  vVvtiuxZZy  disjoint  whenever  their  intersection 

(1.11)  II.  n  II.  =  {0}  , 

i  j 

the  null  vector  only,  or  equivalently 

r(Ai,  A  )  =  r(Ai)  +  r(A^) , 


(1.12) 


4. 

cf.  Marsaglia  and  Styan  (1974,  page  272  (2.19)).  Clearly  (1.7)  =*  (1.12)  for 
all  i  f  j  and  so  (1.9)  =>  (1.11)  for  all  i  ^  j.  The  converse,  however,  holds 
in  general  only  for  k  =  2;  if  k  =  3  and 

(i.is,  a ,-(}).  a2=  (j), 

then  (1.12)  holds  for  all  i  i  j  but  (1.7)  does  not. 

Rao  and  Yanai  (1979,  page  2,  section  2,  lines  4-5)  use  (1.11)  to  define 
U.  and  U.  "disjoint"  and  then  (page  2,  Definition  1)  define  U  ...,  U  as 

1  J  1  K 

"disjoint"  whenever 


(1.14)  u  €  U  u  ...  u  Uk  =  C(A1,  ...,  A^)  =  C(B)  =  C(K'D) 


has  the  unique  representation 


(1.15) 


u 


u.  e  U. ,  i  =  1, . . . ,k. 
i  i 


Rao  (1973,  page  11  (vii))  uses  this  as  a  definition  of  "direct  sum";  see  also 
Takemura  (1980,  page  2,  lines  -6  and  -7). 

To  see  that  (1.15)  follows  from  (1.7)  we  write 


(1.16) 


u  =  Bx 


(1) 


=  K'Dx 


(1) 


=  Bx. 


(2) 


=  K'Dx 


(2) 


Then 


(1.17) 


K'Dx*-1-*  =  K'Dx*-2-* 


y 


5. 


which  follows  at  once  from  (1.8)  and  the  left-hand  rank  cancellation  rule 
(Anderson  and  Styan  (1980,  page  12,  Lemma  2.2)). 

A  connection  between  the  definitions  of  matrix  direct  stun  and  vector-space 
direct  sum  is  provided  by  the  equality  of  the  row  spaces  of  l*n  row  vectors 


(1.18) 


R(K'D)  =  R(D) 


or  equivalently 


(1.19) 


R(AX,  ...,  Afc)  =  R/A1 


R(A  ©  . . .  ©  A^) 


This  follows  from  (1.8)  since  R(K'D)  C  R(D)  always  holds. 

There  seems  to  be  no  direct  connection,  however,  between  the  vector-space 
direct  sum  ©  . .  .  ©  =  C(A  ,  ...,  A^)  of  m*l  vectors  and  the  column 

space  of  kmxl  vectors  of  the  matrix  direct  sum  C(A^  ©  ...  ©  A_  )  . 

The  "definition"  given  by  Takemura  (1980,  page  2,  paragraph  3,  line  3)  of 
(J, ,  . . . ,  ti,  being  (linearly)  independent  is  that  if 

X  K 


(1.20) 


u^  6  (J.,  i  =  1,. . .  ,k  and  ^u^  =  0  then  uj_  =  1  =  1*  •  •  •  »k. 


This  is  given  by  Rao  and  Yanai  (1979,  page  4)  as  the  "result"  (1):  that  if  A^, 
...,  A^  are  "disjoint"  then 


(1.21) 


2  A.x  =  0 

1  i  i 


A.x.  =0,  i  =  1 , . . . ,k. 

li 


6. 


We  may  write  (1.21)  as 

(1.22)  K '  Dx  =  0  <=»  Dx  =  0, 

which  follows  at  once  from  (1.8)  and  the  left-hand  rank  cancellation  rule  (cf. 
Anderson  and  Styan,  1980,  page  12,  Lemma  2.2). 

Rao  and  Yanai  (1979,  page  5,  Theorem  1)  prove  that  if 


(1.23) 


is  a  generalized  inverse  of  B  then 


(1.24)  A.Y.A.  =  A.,  i  =  1, . . . ,k  ,  and 

1111 

follow  if  and  only  if  the  A^  are  "disjoint". 


A.Y.A.  =0  for  all  i  /  j, 

l  l  j 


We  may  write  (1.23)  as 


(1.25)  K'DYB  =  K'D  =  B 


and  (1.24)  as 


(1.26)  DYB  =  D. 

When  the  A.  are  disjoint,  (1.8)  holds  and  so  (1.25)  =*  (1.26)  using  the  left-hand 
rank  cancellation  rule  (Anderson  and  Styan,  1980,  page  12,  Lemma  2.2).  Conversely 

(1.26)  implies  that  r(D)  £  r(B)  =  r(K'D)  <  r(D)  and  so  (1.8)  holds 


7. 


and  the  are  "disjoint". 

This  result  from  Rao  and  Yanai  (1979,  page  5,  Theorem  1)  closely  parallels 

2 

the  result  of  Takemura  (1980,  page  4,  Proposition  2.3):  that  if  P  =  P  and 


C(P)  =  U  =  U 


.  ©  with  U^,  ...,  "independent"  then  P  =  E  p. ,  with 


(1.27) 


P  =  P  ,  i  =  1, . . .  ,k,  and 


P.P. 
i  J 


0  for  all  i  ^  j. 


and  the  P.  are  unique.  The  correspondence  between  Takemura  and  Rao  §  Yanai 
is  found  by  setting  P^  =  A^Y^;  the  assertion  of  uniqueness  was  not  given  by  Rao 
§  Yanai. 

To  prove  Takemura' s  Proposition  2.3  let  us  write 

(1.28)  P  =  BY  =  E^A.Y. 

Ill 

for  some  generalized  inverse  Y  =  B  ,  cf.  (1.23),  and  Theorem  3  of  Marsaglia  and 

Styan  (1974,  page  273).  Setting  P  =  A^Y^  then  shows  that  (1.24)  =>  (1.27).  The 

converse  follows  from  r(A.)  =  dimti.  =  r(P.)  =  r(A.Y.)  and  the  right-hand  rank 

l  i  i  ii  6 

cancellation  rule  Lemma  2.1  in  Anderson  and  Styan  (1980,  page  12). 

To  see  that  the  P.  are  unique  we  write 


(1.29)  P  =  BZ,  where  Z  =  B 


not  necessarily  equal  to  Y.  We  may  do  this  in  view  of  Theorem  3  of  Marsaglia  and 
Styan  (1974,  page  273).  When  the  A^  are  "independent"  then 


(1.30) 


P  =  K'DY  =  K'DZ  =*  DY  =  DZ 


8. 


using  (1.8)  and  the  left-hand  rank  cancellation  rule,  and  so 


is  uniquely  determined. 


2 .  Projection  and  Idempotent  Matrices. 

2 

The  square  matrix  A  is  idempotent  whenever  A  =  A;  such  a  matrix  is 
called  a  "projection  matrix"  by  Takemura  (1980,  page  2).  An  interesting  dis¬ 
cussion  of  this  terminology  is  given  by  Ben- Israel  and  Greville  (1974,  page 
51,  footnote). 

Takemura' s  Proposition  2.1  closely  parallels  Corollary  11.2  of  Marsaglia 
and  Styan  (1974,  page  283) .  Since  the  null  space 

(2.1)  N(I  -  A)  =  (x  :  (I-A)x  =0}  =  (x  :  Ax  =  x} 

it  would  seem  more  appropriate  to  replace  (iii)  and  (iv) ,  respectively,  by 

(iii) •  C(A)  =  M(I-A) ,  and  (iv) ’  W(A)  =  C(I-A) . 

From  (1.7)  we  see  that 

(v)  C( A)  and  C(I-A)  are  "independent"  <=*  r(A,  I-A)  =  r(A)  +  r(I-A).  . . .  (v)  ' 


9. 


Then  it  is  obvious  that  (i)  <=*  (ii)  and  that  (iii)  '  <=*  (iv)  '  ,  while  (i)  «==*■  (v)  ' 
is  (5.15)  (5.21)  in  Corollary  11.2  of  Marsaglia  and  Styan  (1974,  page  283), 

and  (i)  <=*■  (iii)'  is  proved  by  Anderson  and  Styan  (1980,  pages  7-8,  section  2.1). 


Takemura's  Proposition  2.2  (page  4)  extends  Theorem  1.1  of  Anderson  and  Styan 


(1980,  page  5).  Let  A  ,  ...,  A  be  square  matrices,  not  necessarily  symmetric, 

JL  K. 

k 

and  let  A  =  E  A. .  Consider  the  following  statements 


(a) 

A2  =  A., 

l  l 

i  —  1, . . . ,k. 

(b) 

A. A  =  0 

for  all  i  ^  j. 

i  j 

(c) 

< 

II 

M 

< 

(d) 

^r(V  - 

r(A), 

(dl) 

r(V  ••• 

>  V  - 

Cd2) 

C(Ax,  ... 

V* 

V—/ 

II 

o 

> 

V - / 

Then  Takemura's  Proposition  2.2  is  that 


(a),  (b)  =*  (c),  (dl),  (d2) . 

In  their  (1.4),  Anderson  and  Styan  (1980,  pages  5  and  11A)  proved  that 

(a),  (b)  =>  (c),  (d). 


That  (d)  =*  (dl)  is  (6.1)  =*  (6.2)  in  Theorem  13  of  Marsaglia  and  Styan  (1974, 
pages  284-285).  That  (dl)  =* (d2)  follows  at  once  since  C(A  ,  ...,  A  )  always 

1.  K 

contains  C(A),  or  equivalently  C(K'D)  75  C(K'DK). 

Since  (a),  (b)  =*  (dl)  Takemura  (1980,  page  5,  lines  -8  to  -10)  suggests 
that  "it  may  well  be  justified  to  call  projection  matrices  A^,  i  =  l,...k. 


10. 


(linearly)  independent  if  A. A  =  0  for  all  i  ^  j."  I  think  that  this  is 
not  very  wise  since 

(a),  (dl)  ±  (b); 


i . e ., "independent"  projections  A  ,  ...»  A  are  not  necessarily  "pairwise 

1  K 

orthogonal":  A. A.  =  0  for  all  i  ^  j.  For  example  with  k  =  2,  let 


Takemura's  Proposition  2.3  (page  4)  has  been  discussed  already  in  this 
review  (see  page  7) . 

Takemura's  Proposition  2.4  (page  6)  is  well  known  and  can  easily  be  proved 
without  diagonal izability  (I  prefer  using  "diagonability"  and  will  do  so  below), 
cf.  e.g.,  Rao  (1973,  page  28  (i)  ). 


11. 


3.  Cochran-type  Theorems. 


Takemura's  Lemma  3.1  (page  7)  builds  on  his  Proposition  2.2  to  show  that 

(d)  ~  (dl),  (d2) . 

We  have  already  shown  that  (d)  (dl) ,  (d2) ,  cf .  page  9  of  this  review.  To 
go  the  other  way  let  us  write: 

(dl)  :  r(K'D)  =  r(D) 

(d2)  :  C(K'D)  =  C(K’DK) 

(d)  :  r(K'DK)  =  r(D) . 

Clearly  (d2)  =*  r(K’D)  =  r(K’DK)  and  so  (dl) ,  (d2)  =*  (d) . 

Takemura's  Theorem  3.2  (page  9)  builds  on  Theorem  3.3  of  Anderson  and  Styan 
(1980,  page  24),  but  does  not  seem  to  be  a  "simpler  (but  equivalent)  version".  Let 


us  write 


12. 


(a)  : 

r 

Ar  =  A., 

l  l 

i  = 

1,.. 

. ,  k , 

(b)  : 

A. A.  =  0 

for 

all 

i  /  j 

i  1 

(c)r  : 

*4 

n 

> 

w 

(d)  : 

S^r(Ai)  =  r(A), 

(e2)  : 

AA.  =  A. A, 

l  l 

i  = 

1,.. 

.  ,k. 

The  condition  (e2)  was  used  by  Anderson  and  Styan  (1980,  page  18,  Theorem  3.1). 
Takemura's  Theorem  3.2  then  shows  that 

(a)  ,  (b)  <-*  (c)  ,  (d) ,  (e2) . 
r  r 

That 

(a)  ,  (b)  =*  (c)  ,  (d) 
r  r 

was  proved  by  Anderson  and  Styan  (1980,  page  18,  Theorem  3.1),  while 

(b)  =»  (e2) 

always  holds.  To  go  the  other  way,  we  note  first  that 

(d),  (e2)  =>  (b). 


This  was  proved  for  symmetric  matrices  by  Luther  (1965,  page  684,  Theorem  1) 
and  for  square  matrices  not  necessarily  symmetric  by  Marsaglia  (1967,  page  461, 
Theorem  3);  see  also  Marsaglia  and  Styan  (1974,  page  286,  Theorem  15(11)).  Hence 


13. 


(c)  ,  (d) ,  (e2)  =*  (b) ,  (c)  -*  (a)  ,  (b) 

r  r  r 


using  (3.23)  on  page  25  of  Anderson  and  Styan  (1980).  In  their  Theorem  3.3 
Anderson  and  Styan  (1980,  page  24)  proved  that 


(a)  (b)  <=*  (c)  ,  (d),  (e2) 

r  r  r 

where 

(e2)  :  Ar_2A  =  A. Ar-2 ,  i  =  l,...,k. 

r  ii 


Since 


(e2)  ->  (e2)  and  (b)  (e2) 

r 


always  hold,  and  since 


(d) ,  (e2)  (b) 


(see  page  12  above)  it  seems  to  me  that  Takemura's  Theorem  3.2  is  equivalent  to  the 
Anderson  and  Styan  Theorem  3.3. 


14. 


4 .  Matrix  Polynomials. 

Takemura's  Theorem  3.3  (page  10)  extends  the  Anderson  and  Styan  Theorem  3.3 
by  replacing  their  (a)  and  (c)  conditions  by  a  matrix  polynomial  P(.)  of  degree 
at  least  2 .  Let  us  write 


fa’po  : 

P(A.)  =  0, 

i  =  1 ,  . . 

•  »k , 

a-*  PI 

Aip(Ai)  =  0, 

i  =  1 , . . 

.  ,k. 

(b)  : 

A. A.  =  0 

for  all 

i  f 

i  1 

(c)  : 

P(A)  =  0, 

(d)  : 

E^rCA.)  =  r(A), 

(e2)  : 

AA.  =  A. A, 

l  l 

i  =  1 , .  . 

.  ,k. 

Takemura's  Theorem  3.3  then  shows  that 


(<0p,  (d),  (e2)  =*  (a)  p,  (b) , 


where 


^PO 

if 

P(0)  =  0, 

(a)pl 

if 

P(0)  *  o. 

Since  (d)  ,  (e2)  =►  (b)  always  holds,  cf.  page  12  above,  Takemura's  Theorem  3.3 
reduces  to 


(b),  (c)p,  (d)  =*  (a)p . 


(4.1) 


15. 


We  strengthen  (4.1)  by  showing  that 

(4.2)  (b),  (c)p,  (d)  (a)J  , 

where 

(a)*  :  P^)  =  c(I  -  A“Ai) ,  i  =  l,...,k, 

P(O)  =  cl,  and  A  is  a  generalized  inverse  of  A.  Then  (a)*  =*  (a)pQ  if  c  =  0. 
To  prove  (4.2)  we  note  that  when  (b)  holds 

(4.3)  AP  =  Z?A? 

1  l 

for  any  positive  integer  p,  and  so 

(4.4)  P(A)  =  cl  +  zJ[P(A  )  -  cl). 

When  (d)  holds  we  may  apply  Theorem  1.2  of  Anderson  and  Styan  (1980,  page  6)  to 
find 

(4.5)  A.  A  A.  =  A.  ,  i  =  l,...,k,  and  A.  A~A.  =  0  for  all  i  j-  i . 

ill  i  J 

Postmultiplying  (4.4)  by  A  and  applying  (4.5)  yields 

(4.6)  [P(A)]A_Ai  =  cA~A^  +  [P(A  )  -  cl),  i  =  l,...,k, 

and  so  (c)p  (a)*.  Premultiplying  (a)*  by  A.  yields  (a)p^. 


16. 


Takemura's  Lemma  4.1  (page  14)  shows  that  if  P(x)  =  0  has  no  multiple  root 
and  if  (c)p:  PfA)  =  0  holds,  then  A  is  diagonable.  Since  (by  definition)  the 
minimal  polynomial  must  divide  P(A)  it  follows  that  when  P(A)  has  no  multiple 
root  then  neither  does  the  minimal  polynomial  $(A) .  This  characterizes  diagonability 
of  the  matrix  A  ,  cf.  Mirsky  (1955,  page  297,  Theorem  10.2.5),  and  so  the  con¬ 
dition  in  Lemma  4.1  does  not  seem  to  be  as  strong  as  Takemura  claims  on  page 
13  (line  -1)  . 

Takemura’s  Theorem  4.1  is  not  new.  It  contains  his  Lemma  4.1  and  the  de¬ 
composition  into  "independent  projections",  more  usually  known  as  "principal 
idempotents"  (cf.  Ben- Israel  and  Greville,  1974,  pages  52-55,  especially  Theorem  9 
and  Exercise  27) . 

x 

To  Takemura's  Corollary  4.1  should  be  added  £,H.  =  I. 

1  3 

Takemura's  Theorem  4.2  (pages  15-16)  considers  a  polynomial  P(x)  with  no 
multiple  root  and  then  extends  his  Theorem  3.3  to  show  that 


(c)  ,  (d),  (e2)  =>  A  =  I A  H 
F  j=lJ  ^ 


where  A,,  ...,  A„  are  the  nonzero  characteristic  roots  of  A  and  the  H. . 

1  5  l  ij 

are  "independent  projections"  (principal  idempotents).  In  fact  t  ,  =  0 

whenever  i  f  i'  and/or  j  /  j'  .  The  condition  that  P(x)  has  no  multiple 
root  assures  that  (c)  =►  A  diagonable.  Since  (d)  ,  (e2)  =s>  (b)  we  see  the 
interesting  result  (essentially  Takemura's  Theorem  4.4,  page  21)  that 


(4.7)  A  diagonable,  (b) ,  (d)  =*  A_^  diagonable,  i  =  l,...,k. 


17. 


[We  must  still  include  (d)  in  (4.7)  since  (b)  &  (d)  in  general;  however  (b) 
always  implies  (e2) . ] 

We  may  extend  (4.7)  by  noting  that  a  matrix  A  is  diagonable  if  and  only  if 
all  its  characteristic  roots  are  regular  (cf.  Mirsky,  1955,  page  294,  Theorem 
10.2.3).  The  characteristic  root  A  of  the  matrix  A  is  said  to  be  regular 
whenever  its  geometric  and  algebraic  multiplicities  are  equal  (cf.  Mirsky,  1955, 
page  294,  Definition  10.2.1).  The  algebraic  multiplicity  of  A  is  the  multi¬ 
plicity  of  A  as  a  root  of  the  characteristic  equation;  the  geometric  multi¬ 
plicity  is  the  nullity  of  the  matrix  A  -  Al. 

Let  A  be  n*n  of  rank  r,  and  let  A.  have  rank  r. ,  i  =  l,...,k.  Let 

l  l 

A.,  ...,  A  be  the  nonzero  characteristic  roots  of  A.  Let  m  be  the  alge- 
1  &  ij 

braic  multiplicity  and  g. .  the  geometric  multiplicity  of  A.  as  a  character¬ 
istic  root  of  A^,  so  that,  cf.  Mirsky  (1955,  page  214,  Theorem  7.6.1), 

(4.8)  n  2:  in.  .  >  g_  SO;  i  =  l,...,k,  j  = 


Let  m^j  be  the  algebraic  multiplicity  and  g^.  the  geometric  multiplicity  of 

A.  as  a  characteristic  root  of  A.  Then 
1 


(4.9) 


n  2  V  *  V  s  1;  J-1--*- 


Let  jiUq  be  the  algebraic  multiplicity  and  g  the  geometric  multiplicity  of 
0  as  a  characteristic  root  of  A. ,  i  =  l,...,k.  Then 


£ 

(4.10)  m  =  n  -  mi#  =  n  -  \  m  i  =  l,...,k, 

j  =  l  J 


(4.11)  g^  =  n  -  r^  =  n  -  rank(A.),  i  =  l,...,k. 


18 


Hence 


(4.12) 


n  >  r.  s  m.  >  0;  i  =  l,...,k. 
i  i* 


Let  hIqq  be  the  algebraic  multiplicity  and  let  g  be  the  geometric  multi¬ 
plicity  of  0  as  a  characteristic  root  of  A.  Then 


(4.13) 


moo  =  n  "  V 


=  n  - 


I  •  j 
j-1  03 


(4.14) 


gQ0  =  n  -  r  =  n  -  rank(A) 


Hence 


(4.15) 


n  >  r  >  m  >  0. 

0* 


Let 


(4.16) 


A.  =  B.C!, 
l  ii 


i  =  1 , . . .  ,k. 


be  full  rank  decompositions,  so  that  B  and  C.  are  both  nxr  of  rank  r' 

ii  i  i 

Then 


(4.17) 


A  =  IT  A.  = 

1  l 


E^B.C! 

111 


=  BC\ 


where 


(4.18)  B  =  (B1,  ...,  B  )  and  C  =  (C  ,  . ..,  Cfc) 


are  both  nxl  r,. 

1  i 


19. 


Now  suppose  that 


(b) 


A.  A  =0 

1  j 


for  all  i  j-  j 


holds.  Then  C|B.  =  0  for  all  i  t  j  and  so 
i  3 


(4.19) 


C'B  =  /CJB 


=  C'B  ©  . . .  ©  C’B,  , 

11  k  k 


C’B. 
k  k 


the  direct  sum  of  the  C!B.. 

l  i 

Let  am_.  (A)  denote  the  algebraic  multiplicity  and  let  gnu  (A)  denote  the 
geometric  multiplicity  of  A^  as  a  characteristic  root  of  A.  Then  using  the 
fact  that  the  matrices  FG  and  GF  have  the  same  nonzero  characteristic  roots 
(cf.  Mirsky,  1955,  page  200,  Theorem  7.2.3),  we  may  write  for  j  =  !,...,£ 


(4.20) 


nn  .  =  am. (A)  =  am.(BC')  =  am. (C'B)  =  7  am.(C!B.) 

II  1  -i  t  i  ^  nil 


0j 


i=l 


3  i  1 


=  l  am . (B . C ! )  =  7  m. 

A  3  i  i  i“1  i 3 


i=l 


while 


(4.21) 


k  £ 


”0-  =  A."oj  =  \  =  mi 

j=l  i=l  j=l  i=l 


so  that 


n  -  am  (A)  = 

0 


(4.22) 


E^[n  -  am  (A.)] . 
1  0  i 


20. 


We  now  use  the  result  that  the  matrices  FG  -  I  and  GF  -  I  have  the  same 
nullity,  cf.  Ouellette  (1981,  equation  (4.147)).  Let  v(.)  denote  nullity.  Then 
for  each  j  =  1 , .  . . ,  H 


(4.23) 


g  .  =  gm.(A)  =  V (A  -  A. I)  =  v(BC'  -  A. I)  =  v(C'B  -  A. I) 
0j  3  3  3  3 


k  k 

=  I  V(C!B,  -  A, I  )  =  l  v(B,C!  -  A. I  ) 


•  i  11  3  r- 

1=1  J  l 


i=l 


ii  j  n 


=  JV(Ai-  ^In)  *  l  hy 

i=l 


Hence  when  (b)  holds  all  the  nonzero  characteristic  roots  of  the  A^,  i  =  l,...,k, 
must  be  characteristic  roots  of  A,  and  all  the  nonzero  characteristic  roots  of 
A  must  be  characteristic  roots  of  A.  for  some  i. 

l 


Furthermore,  since  g. .  <  m  from  (4.8)  we  obtain 

iJ  1 3 


(4.24) 


k  k 

=  I  ^  I  ‘jj  =  m 


3oj 


i-1  ^  i  =  l  13 


0j’ 


j  “  1,  ...,&, 


and  so  for  each  j  =  !,...,£ 


(4.25) 


E0j  -  »0,  -  hi  =  V  1  = 


Thus  if  A.  is  a  regular  nonzero  characteristic  root  of  A  then  when  (b)  holds 
A_.  is  also  a  regular  characteristic  root+  of  each  A^,  i  =  l,...,k.  This  does 
not,  however,  hold  true  for  the  0  characteristic  root  of  A,  for  if  k=2  and 


t  Notice  that  g^  =  0  m„  =0;  we  will  then  speak  of  A.  as  a  regular  char¬ 
acteristic  root  of  A.  even  though  A.  does  not  have  A  as  a  root. 

1  1  3 


21. 


(4.26) 


l) 

and  A  = 

r° 

-1) 

0  / 

k  0 

0  / 

then  (b)  holds  and  A  =  0  so  0  is  a  regular  characteristic  root  of  A,  but 
0  is  clearly  not  a  regular  characteristic  root  of  either  A^  or  of  A^.  The 
equation  (4.23)  is  no  longer  valid  when  A  =  0  since  the  matrices  FG  and 
GF  do  not  necessarily  have  the  same  nullity  (or  rank).  However 


(4.27) 


n  -  gmQ(A)  = 


=  r 


yk 

Vi 


=  Z1[n  -  gmQ(Ai)]. 


Now  suppose  that  in  addition 

(d)  EJr(V  =  r(A) 

holds.  Substitution  in  (4.27)  then  yields 

(4.28)  n  -  gmQ(A)  =  E*[ n  -  gm^A.)] 

and  so 

(4.29)  gmQ(A)  =  n  -  Z*[n  -  gm^Aj] 

k  k 

<  n  -  £1[n  -  am^A^]  =  n  -  =  n  -  mQ> 

Hence  if  0  is  a  regular  characteristic  root  of  A  then  0  must  be  a  regular 
characteristic  root  of  A^,  for  all  i  =  l,...,k.  We  have,  therefore,  proved 
the  following  stronger  version  of  Takemura’s  Theorems  4.2  and  4.4  and  our  (4.7) 


22. 


THEOREM  4.1. 

and  let  A  = 


Let  A  ,  .  . . ,  A  be 

-L  K 

k 

E^A.  .  Suppose  that 


nxn  matrices  ,  not  necessarily  symmetric , 


(b) 


A  A.  =  0  for  all  i  j  . 

i  J 


Then  the  set  of  nonzero  characteristic  roots  of  A  coincides  with  the  set  of 
all  the  nonzero  characteristic  roots  of  all  the  A.,  i  =  Furthermore , 

the  nonzero  characteristic  root  h  of  A  is  regular  if  and  only  if  A  is  a 
regular  nonzero  characteristic  root  of  each  A.,  i  = 

If,  in  addition , 


(d)  Zkrank(AJ  =  rank  (A), 

1  i 

then  the  characteristic  root  0  of  A  is  regular  if  and  only  if  0  is  a 
regular  characteristic  root  of  each  A.,  i  =  l,...,k.  Equivalently, 

(4.30)  rank  (A2)  =  rank  (A)  ^  rank  (A2)  =  rank  (A  J,  i  =  l,...,k. 

Makelainen  and  Styan  (1976,  Lemma  2)  have  shown  that  the  zero  characteristic 

2 

root  of  a  matrix  A  is  regular  if  and  only  if  r(A  )  =  r(A) .  Such  a  matrix  A 
is  said  to  have  index  1,  cf.  Ben-Israel  and  Greville  (1974,  page  169),  A  direct 
proof  of  (4.30)  goes  as  follows.  Let 

(4.31)  D  =  A  ®  ...  ©  Ak  and  K  =  (I,  . . . ,  I) » , 

2  2  2 

cf.  (1.3)  and  (1.4)  on  page  2.  Then  under  (b)  DKK'D  =  D  so  that  A  =  K'D  K, 
while  (d)  yields 


23. 


(4.32) 


r(D)  =  r(K'DK)  =  r(K'D)  =  r(DK) 


using  Sylvester's  Law  of  Nullity,  cf.  Anderson  and  Styan  (1980,  page  11A).  Thus 
under  (b)  and  (d)  the  result  (4.30)  becomes 


(4 . 33) 


r(K'D2K) 


r(D) 


if  and  only  if 

(4.34)  r(D2)  =  r(D)  . 

2  ? 

That  (4.33)  =*  (4.34)  follows  at  once  from  r(K'D  K)  ^  r(D  )  £  r(D) .  To  go  the 
other  way  suppose  that  (4.34)  holds.  Then  using  (4.32)  and  the  rank  cancellation 
rules  (2.10)  and  (2.11)  in  Theorem  2  of  Marsaglia  and  Styan  (1974,  page  271), 
we  obtain  r(K'D2K)  =  r(D2K)  =  r(D2),  and  so  (4.34)  =>  (4.33). 

We  see  also  that  if  (b)  and  (d)  hold  then  for  any  positive  integer  p 
(4.32)  implies  that  r(K'D^K)  =  r(D^)  and  so  we  have  proved  that 


THEOREM  4.2.  Let  the  nxn  matrices  A^,  .. 
Theorem  4.1  and  suppose  that  both  (b)  and 
integer  p 


,  A^  and  A  be  defined  as  in 
(d)  hold.  Then  for  any  positive 


(4.35) 


rank (A?)  =  rank  ^  ^ 


Takemura's  Theorem  4.3  (page  16)  is  well  known;  for  a  nice  proof  see 
Mirsky  (1,955,  page  318,  Theorem  10.6.3). 


24. 

We  may  strengthen  Takemura's  Corollary  4.2  (page  20)  by  allowing  the 
polynomial.  P(x)  to  possibly  have  multiple  roots  and  by  not  requiring  that 
P(0)  =  0.  Instead  then  let  P(0)  =  cl  as  on  page  15  of  this  review.  Thus 
from  (4.3)  we  find  that  if  P(A^)  =0,  i  =  l,...,k,  then  under  (b)  = (i) 

(4.36)  P(A)  =  -(k-l)cl. 

Since  (b)  always  implies  (e2)  =  (iv) ,  it  remains  to  consider  when  does 
(b)  imply  (d)  =  (v) .  That  (b)  (d)  in  general  has  already  been  noted 

(top  of  page  17  of  this  review;  see  also  Anderson  and  Styan  (1980,  page  5)). 
Takemura's  Corollary  4.2  shows  that 

(4.37)  (b)  and  Ai  diagonable,  i  =  l,...,k  =»  (d) . 

This  result,  however,  is  implied  by 

(4.38)  (b)  and  rank(A^)  =  rank  (A  J,  i  =  l,...,k  **  (d) , 

which  was  proved  by  Marsaglia  and  Styan  (1974,  page  286,  Theorem  15(1)).  We 

2 

recall  that  rank(A.)  =  rank(A.)  means  that  the  zero  characteristic  root  of 

l  l 

A^  is  regular,  while  A.  diagonable  means  that  all  the  characteristic  roots 
are  regular,  cf.  pages  17  and  22  of  this  review. 


ACKNOWLEDGEMENTS 

This  research  was  supported  in  part  by  the  Natural  Sciences  and  Engineering 
Research  Council  Canada,  Grant  Number  A7274,  and  by  the  Gouvemement  du  Quebec, 
Programme  de  formation  de  chercheurs  et  d' action  concertee,  subvention  no.  EQ-961. 


25. 


REFERENCES 

Anderson,  T.W.,  and  Styan,  George  P.H.  (198°) •  Cochran's  theorem,  rank  additivity, 
and  tripotent  matrices.  Technical  Report  No. 44,  Dept,  of  Statistics,  Stanford 
University.  Revised  version  published  in  Statistics  and  Probability :  Essays 
in  Honor  of  C.R.  Rao  (G.  Kallianpur,  P.R.  Krishnaiah,  §  J.K.  Ghosh,  eds .), 
North-Holland,  Amsterdam,  1982,  pp.  1-23. 

Ben^Israel,  Adi,  and  Greville,  Thomas  N.E.  (1974).  Generalized  Inverses:  Theory 
and  Applications.  Wiley,  New  York.  Reprinted  with  corrections,  Robert  E. 
Krieger,  Huntington,  New  York,  1980. 

Luther,  Norman  Y.  (1965) .  Decomposition  of  symmetric  matrices  and  distributions 
of  quadratic  forms.  The  Annals  of  Mathematical  Statistics,  36,  683-690. 

Makelainen,  Timo,  and  Styan,  George  P.H.  (1976).  A  decomposition  of  an  idempotent 
matrix  where  nonnegativity  implies  idempotence  and  none  of  the  matrices  need 
be  symmetric.  Sankhyd  Series  A,  38,  400-403. 

Marcus,  Marvin,  and  Mine,  Henryk  (1964).  A  Survey  of  Matrix  Theory  and  Matrix 
Inequalities.  Allyn  and  Bacon,  and  Prindle,  Weber  and  Schmidt,  Boston. 

Marsaglia,  George  (1967) .  Bounds  on  the  rank  of  the  sum  of  matrices.  Transactions 
of  the  Fourth  Prague  Conference  on  Information  Theory,  Statistical  Decision 
Functions,  and  Random  Processes  (Prague,  August  31  -  September  11,  1965), 
Czechoslovak  Academy  of  Sciences,  455-462. 

Marsaglia,  George,  and  Styan,  George  P.H.  (1974)  .  Equalities  and  inequalities  for 
ranks  of  matrices.  Linear  and  Multilinear  Algebra,  2,  269-292. 

Mirsky,  L.  (1955).  An  Introduction  to  Linear  Algebra.  Oxford  University  Press. 

Ouellette,  Diane  Valerie  (1981).  Schur  complements  and  statistics.  Linear  Algebra 
and  its  Applications  (Letters  in  Linear  Algebra),  36,  187^295.  [M.Sc.  thesis. 
Dept,  of  Mathematics,  McGill  University,  MontrSal,  July  1978.] 

Rao,  C.  Radhakrishna  (1973).  Linear  Statistical  Inference  and  Its  Applications. 
Second  Edition.  Wiley,  New  York. 


26. 


Rao,  C.  Radhakrishna,  and  Mitra,  Sujit  Kumar  (1971).  Generalized  Inverse  of 
Matrices  and  Its  Applications .  Wiley,  New  York. 

Rao,  C.  Radhakrishna,  and  Yanai,  Haruo  (1979).  General  definition  and  decom¬ 
position  of  projectors  and  some  applications  to  statistical  problems. 

Journal  of  Statistical  Planning  and  Inference ,  3,  1-17. 

Styan,  George  P.H.  (1981).  Review  of  Rao  and  Yanai  (1979).  Mathematical 
Reviews ,  in  press. 

Takemura,  Akimichi  (1980).  On  generalizations  of  Cochran's  theorem  and 
projection  matrices.  Technical  Report  No. 44,  Dept,  of  Statistics, 
Stanford  University. 


GPHS  :  01.01.81 
Updated:  09.07.82 


UNCLASSIFIED 


SECURITY  CLASSIFICATION  OF  THIS  PAGE  (When  Date  Entered) 


REPORT  DOCUMENTATION  PAGE 

READ  INSTRUCTIONS 

BEFORE  COMPLETING  FORM 

1.  REPORT  NUMBER  2.  GOVT  ACCESSION  NO. 

2 

3.  RECIPIENT’S  CATALOG  NUMBER 

4.  TITLE  (and  Subtitle) 

A  Review  and  Some  Extensions  of  Takemura's 
Generalizations  of  Cochran's  Theorem 

5.  TYPE  OF  REPORT  4  PERIOD  COVERED 

Technical  Report 

6.  PERFORMING  ORG.  REPORT  NUMBER 

7.  AUTHORfs) 

George  P.H.  Styan 

8.  CONTRACT  OR  GRANT  NUMBERfs) 

DAAG29-82-K-0156 

9.  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 

Department  of  Statistics  -  Sequoia  Hall 

Stanford  University 

Stanford,  California  94305 

10.  PROGRAM  ELEMENT.  PROJECT,  TASK 
AREA  a  WORK  UNIT  NUMBERS 

P-19065-M 

11.  CONTROLUfGJpF^N^^  Wf|^SS 

Post  Office  Box  12211 

Research  Triangle  Park,  NC  27709 

12.  REPORT  DATE 

September  1982 

13.  NUMBER  OF  PAGES 

26 

14.  MONITORING  AGENCY  NAME  4  ADDRESSfi/  different  from  Controlling  Ofllce) 

15.  SECURITY  CLASS,  (of  this  report) 

UNCLASSIFIED 

15e.  declassification/  down  grading 
schedule 

16.  DISTRIBUTION  STATEMENT  (of  thle  Report) 

uSS  f0r  PUbliC  distribution 

17.  DISTRIBUTION  STATEMENT  (ot  the  ebetrect  entered  in  Block  20,  if  different  from  Report) 

Also  issued  as  Technical  Report  No.  56  under  Office  of  Naval  Research  Contract 
No.  N00014-75-C-0442. 

18.  SUPPLEMENTARY  NOTES 

OP'N,ONS-  ANC/°R  findings  contained  in  this  report 

ARE  THOSc  OF  THc:  AUTHOR(S)  AND  SHOULD  ’T”  de  CONSTRUED  aq 

AN  OFFICIAL  DEPARTMENT  OF  THE  ARMY  POSITION,  pJt  ICY  OR  DF 

CISION,  UNLESS  SO  DESIGNATED  BY  OTHER  DOCUMENTATION 

19.  KEY  WORDS  (Continue  on  reveree  aide  it  necessary  end  identify  by  block  number) 

Cochran's  theorem,  rank  additivity,  r-potent  matrices,  direct  sum,  projection 
matrices,  diagonability,  matrix  polynomial. 

2Q.  ABSTRACT  (Continue  ea  reverse  side  1 1  necessary  and  identify  by  block  number) 

Generalizations  of  Cochran's  theorem  by  Takemura  are  discussed  in  con¬ 
nection  with  results  by  Anderson  and  Styan  and  other  works  by  Rao  and  Yanai 
and  Marsaglia  and  Styan.  A  number  of  further  extensions  are  presented. 

DD 


FORM 
!  JAN  73 


1473 


EDITION  OF  *  NOV  65  IS  OBSOLETE 


UNCLASSIFIED 


SECURITY  CLASSIFICATION  OF  THIS  PAGE  (When  Dete  Entered) 


